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the claims : 



]\. (Currently amended) A method comprising: 
segmenting video data to create a video clip based on 
timing data tnkt indicates a specified timing within a gesture 
will occur ; and 

determining ^formation related to a a moot likely gesture 
occurring in the video clip only at the specified timing . 




2. (Currently amencted) The method of claim 1, wherein 
determining includes dete Wining a probability that each of a 
plurality of predefined gesVures which are performed in the 
video clip contains the predefined gesture. 

3. (Original) The method oA claim 2, wherein determining 
the probability that the video clip, contains each of the 
predefined gesture includes evaluations of Hidden Markov Models, 



4. (Original) The method of claim 1, wherein the timing 
data includes beat data corresponding to a beat of audio data* 



5. (Original) The method^f claim 4, further comprising: 
receiving the audio data; ^nd 

extracting the beat data frA>m the audio data, 
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. • a i\ The method of 
€. (Original)\The correspo nding to a 

„ nr ^on of the video data cox 
clip includes a portion 

• surrounding the occurrence of 
predefined time windoV\sur 

one beat . 

* , rlmim 1, fM"her comprising 
(Original) B» «« tho4 of =1 
\ . to be performed by the m*l** of 

d i 3P laX* «r 3 et gesture to be P 

the video O^ta* 

*-^-in« videos frames, 
clip contains via«<* 



, 0 of I- X. further coding 

identify^ — W regio^m each 

clip- 

\ * i a~i m 9 further comprising 
10 . (Original) The method\f ci al m 9 . ^ 

• feature vector for \ch video frame of the 
generating a feature v \ 

clip- 

fc . ftf cla \ 1, further comprising 
U (Original) The method of claimi, 

wher theVdeo clip contains the 
a score based on whether the ^ 
generating a score \ 

target gesture. 
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(Original) The method of claim 11 , further comprising 
displaying the score. 



13. (Ora^inal) The method of claim 1, wherein determining 
if the video cMp contains the predefined gesture includes 



\ 



generating a gesbure probability vector having a plurality of 



\ 



elements, each element being associated with one of a plurality 



\ 



of predefined gestures and representing a probability that the 



\ 




video clip contains each of the associated predefined gestures. 

14. (Currently amended) A system comprising: 
a temporal segmentor connected to receive video data and to 
create a video clip from the Video data based on timing data 
that indicates a specified timing within which a gesture will 
occur ; and 

a recognition engine, in communication with the temporal 
segmentor, to determine if the video clip contains a predefined 
gesture , only at the specified timinc 



15. (Original) The system of claim \l4, wherein the 
recognition engine includes a plurality o£ Hidden Markov Models. 



16. (Original) The system of claim 14, further comprising: 
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a\timing data source, in communication with the temporal 
segmentor^ to provide the timing data to the temporal segmentor; 
and 

a video ^source, in communication with the temporal 
segmentor, to provide the video data to the temporal segmentor. 




17. (Original K^The system of claim 14, further comprising a 
move subsystem, in co^nmiinication with the timing data source, to 
provide a target gesture to be performed by the subject of the 
video data. 

18. (Original) The system of claim 17, wherein the target 
gesture is a dance move that ^s to be performed by the subject 
of the video data. 



19, (Original) The system ofVrlaim 17, further comprising a 
scoring subsystem, in communi cat ionV with the recognition engine 
and the move subsystem, to determine \if the video clip contains 
the target gesture . 



20. (Original) The system of claim ia, further comprising a 
display subsystem, in communication with th^ scoring subsystem, 
to display a score that is a function of whe^ier the video clip 
contains the target gesture . 
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2 lYv (Original) The system of claim 20, wherein the display 
subsystem re in communication with the move subsystem and is 
configured toN^lisplay a gesture request based on the target 
gesture. 




22. (Original) The system of claim 14, wherein the 
recognition engine is configured to recognize predefined 
gestures and to produce a\jesture probability vector having 
elements, each element being, associated with one of the 
predefined gestures and representing the probability that the 
video clip contains the associated predefined gesture. 



23 /\(Original) The system of claim 14, wherein the timing 

data source includes: 

an audio sotaxce that provides an audio data; and 

a beat extractor, in communication with the audio source, 

that extracts beat dat\ from the audio data. 



24. (Original) The system of claim 23, wherein the video 
clip corresponds to a beat in the beat data. 
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25. (Origina^J^he system of claim 24, wherein the video 
clip includes a pojSfcipn of the video data corresponding to a 
predefined time wmddjv surrounding the occurrence of the beat, 



2e\, (Currently amended) A computer program product, 
tangibly ^tored on a computer- readable medium, for recognizing 
gestures contained in video data, comprising instructions 
operable to c&jise a programmable processor to: 

segment th^ video data to create a video clip based on 
timing data that Vdicates a specified timing within which a 
gesture will occur ; Nand 

determine if the Video clip contains a predefined gesture 
within the specified timing . 

2\. (Original) The product of claim 26, further comprising 
instructions operable to cause the programmable processor to; 

extraOt beat data from an audio signal; and 
segment the video data to create the video clip using the beat 
data. 



28. (Currently amended) An audio-visual processing system 
including: 

a video source to provide video data; 
an audio source to provide audio data; 
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speaker to play at least a portion of the audio data; and 
a computer program product, tangibly stored on a computer- 
readable medium, for recognizing gestures contained in video 
data, comprising instructions operable to cause a programmable 
processor, in\:ommunication with the yideo source and the audio 
source , to: 

extract beat J^ata from the audio data; 

segment the vid^p data to create a video clip based on said 
beat data; and 

determine if the vi&eo clip contains a predefined gesture 
within only a specified tiding related to said beat data. 



29, (Original) The video processing system of claim 28, 
wherein the computer program product further includes 
instructions operable to cause tha programmable processor to: 

perform a Hidden Markov Model Vrocess to determine if the 
video clip contains the predefined g^teture. 



30* (Original) The video processing\system of claim 28, 
further comprising a display to display information based on 
whether the video clip contains the predefined gesture. 
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